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Abstract 

We consider a class of convergence questions for infinite products that arise in 
wavelet theory when the wavelet filters are more singular than is traditionally built 
into the assumptions. We establish pointwise convergence properties for the absolute 
square of the scaling functions. Our proofs are based on probabilistic tools. 
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1 Introduction. Wavelets 



We will be primarily interested in the infinite products that typically occur in 
wavelet analysis [12, Ch. 5]; see also [7]. As is well known, the traditional ap- 
proach to wavelet- scaling functions is based on analysis of wavelet niters which 
are periodic functions in one or more variables, and which have some degree of 
regularity (say, Lipschitz at low frequencies). In addition, the filter functions 
must satisfy a variant of the following two features: (i) some quadrature con- 
dition, and (ii) some low-pass condition. But conditions (i)-(ii) are known to 
be incompatible with several new classes of examples. For these wider classes 
of wavelets, we show that a new uncertainty principle for wavelet filters is 
at play. Reflecting this uncertainty principle for wavelet filters, we note that 
recent papers on wavelet sets, and on other frequency-localized wavelets (see, 
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e.g., [3,4,10,11]), necessitate the consideration of wavelet filters which are de- 
fined only a.e., and for which conditions (i)-(ii) are more subtle. As a result, 
the standard deterministic methods then do not immediately apply. 

In this paper, we use a random walk model (see [13,44]) in a new analysis 
of wavelets, but for a class of wavelets where the filters are more singular 
than is the case in the standard references [12,20,26,29,38,39,46]. Our results 
apply also to iterative algorithms for wavelet packets [47,48]. In these problems 
solutions are constructed from an algorithm which relates a certain function 
tp(t), t e R, called a scaling function, to its scaled version <p(Nt) where N is a 
fixed integer, 2 or more, or an expansive integral d by d matrix A if t is in M. d . 

For d—1, the formula which determines f is 

(p (t) = N OfcV? {Nt - k) , teR. (1.1) 

kei 

It is called the scaling identity. Consider the filter function m{x) = dke~ l2 ' Kkx 

for z = exp(—i27rx) in the circle T, or the ci-torus T d . It is traditional to 
impose some degree of regularity, e.g., Lipschitz, or low-pass for x = 0; but 
we shall not do this. Instead we shall work with a probabilistic notion which 
serves as a minimal restriction on the filter m (x) guaranteeing the existence 
of L 2 solutions to the scaling identity (1.1). It is not at all clear a priori 
that this identity should have L 2 solutions, or even solutions that are given by 
some kind of convergent algorithm. The coefficients in (1.1) are called masking 
coefficients, filters, or filter coefficients. For special values of the coefficients, 
it turns out that there is a normalized L 2 solution <p, and that pointwise 
convergence of a good approximation can be established in a meaningful way. 
Moreover, we show that the convergence behavior is dictated by properties of 
an associated transfer operator R, or Ruelle operator. 

A main point in this paper is our suggestion of a different and more ver- 
satile approach to how we impose low-pass conditions on the basic wavelet 
filters. Our approach is probabilistic, and it is based directly on the Ruelle 
operator R = R w , and on a certain Perron-Frobenius eigenfunction h for R 
(see Theorem 5.2). Our work is motivated in part by recent considerations 
of frequency- localized wavelets, as pioneered in for example in the papers 
[2,4,5,10,11]. These papers suggest an interesting context for wavelets that 
are necessarily frequency-localized. But at the same time, it is evident from 
this work that the corresponding wavelet filters will then typically not satisfy 
the low-pass properties that are otherwise known to hold for more traditional 
time- (or space-) localized wavelets. 

The expression on the right-hand side in equation (1.1) is also called a subdivi- 
sion because of the function values ip(Nt — k). An iteration of the operations 
on the right hand side in (1.1) on some initial function is called a cascade 
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approximation. This approximation is one approach to the function ip, and 
the other is the infinite product formula (3.5) for the Fourier transform of (p. 

The relation between (p and its scaled refinement is well understood when we 
pass from the time domain to the frequency domain, via the Fourier transform. 
In that case the relation is multiplicative, and involves a certain periodic 
matrix function m, called the low-pass filter. For further details, see formulas 
(3.2) and (3.4) below. 

The study of the filters m is part of signal processing (see, e.g., [31,45]). But 
by a 'mathematical miracle', they have become one of the most useful tools 
in wavelet constructions; and at the same time, they have pointed to a host 
of exciting applications of wavelet mathematics; see, e.g., [27]. To get a path 
space measure for some of the wavelet problems, we use the quadrature mirror 
properties (see (2.3) below), or their generalizations, which are assumed for 
the filter function m, also called a frequency response function. It is periodic, 
in one or several variables. In R d , there is a variety of choices of a period lattice 
for the problems at hand. 

For a number of applications (see especially [4,5,10,11]), we must consider 
vector versions of the scaling identity (1.1); and in that case, the solution <p, 
the scaling function, will be viewed as a vector- valued function, i.e., a function 
from M d into some Hilbert space, typically finite-dimensional. In that case, the 
coefficients in (1.1) will be matrix- valued; and the product on the right- 
hand side in (1.1) will be matrix acting on vector. Following [4], we then 
say that the initial resolution subspace Vo in L 2 (IR d ) has multiplicity. In the 
more traditional multiresolution analysis (MRA) approach to wavelets, <p is a 
scalar function, and V is the closed span of the Z-translates of (p. The case 
of multiplicity is called the generalized MRA (GMRA) approach. The present 
considerations deal with the issue of passing from the filter function (possibly 
matrix-valued) to the scaling function (p. However, the formulation of our ideas 
in the scalar case may easily be modified to the matrix/vector case; and for 
the sake of simplicity, our technical discussion below will be presented in the 
scalar case. We leave to the reader the spelling out of the generalization to 
GMRAs. 

The fact that there are solutions in L 2 (M. d ) is not at all obvious; see [12]. In ap- 
plication to images, the subspace V (where V := cl span{ <p( ■ — k) \ k e TL d }) 
may represent a certain resolution, and hence there is a choice involved, but 
we know by standard theory, see, e.g., [12], that under apropriate conditions, 
such choices are possible. As a result there are extremely useful, and compu- 
tationally efficient, wavelet bases in L 2 (IR d ). A resolution subspace Vo within 
L 2 (R d ) can be chosen to be arbitrarily fine: Finer resolutions correspond to 
larger subspaces. 
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As noted for example in [8,18], a variant of the scaling equation is also used 
in computer graphics: there data is successively subdivided and the refined 
level of data is related to the previous level by prescribed masking coefficients. 
The latter coefficients in turn induce generating functions which are direct 
analogues of wavelet filters. 

One reason for the computational efficiency of wavelets lies in the fact that 
wavelet coefficients in wavelet expansions for functions in V may be computed 
using matrix iteration, rather than by a direct computation of inner products: 
the latter would involve integration over M d , and hence be computationally 
inefficient, if feasible at all. The deeper reason for why we can compute wavelet 
coefficients using matrix iteration is an important connection to the subband 
filtering method from signal/image processing involving digital filters, down- 
sampling and up-sampling. In this setting filters may be realized as functions 
m on a rf-torus, e.g., quadrature mirror filters. 

As emphasized for example in [34] and [9], because of down-sampling, the 
matrix iteration involved in the computation of wavelet coefficients involves 
so-called slanted Toeplitz matrices F from signal processing. The slanted ma- 
trices F are immediately available; they simply record the numbers (mask- 
ing coefficients) from the (^-scaling equation. These matrices further have the 
computationally attractive property that the iterated powers F k become su- 
cessively more sparse as k increases, i.e., the matrix representation of F k has 
mostly zeros, and the non-zero terms have an especially attractive geometric 
configuration. In fact subband signal processing yields a finite family, F, G, 
etc., of such slanted matrices; for example (with L for "low frequency" and H 
for "high frequency"): 



The wavelet coefficients at scaling level A; of a numerical signal s from Vo are 
then simply the coordinates of GF k s. By this we mean that a signal in Vo is 
represented by a vector s via a fixed choice of scaling function; see [12,8,31]. 

Given some choice of scaling function tp, then under suitable conditions (e.g., 
orthogonality or a priori frame estimates) we may define the operator W : Vo — * 
£ 2 which links the resolution subspace Vo with the sequence space £ 2 of signals 
s = (sfc) as follows: 



v? = ]TP fe v?(2 • -k), V = E<?^( 2 
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Here W becomes a well defined linear operator, W : V — * I 2 

Then the matrix product GF k is applied to s; and the matrices GF k get more 
slanted as k increases. 

Our approach begins with the observation that the computational feature of 
this engineering device can be said to begin with an endomorphism ta of the 
o?-torus T d = M. d /'Z d , an endomorphism which results from simply passing ma- 
trix multiplication by A on M. d to the quotient by Z d . It is then immediate that 
the inverse images r^i^x) are finite for all x in T d , in fact #r^ 1 (x) = | det A\. 
From this we recover the scaling identity, and we note that the wavelet scaling 
equation is a special case of a more general identity known in computational 
fractal theory and in symbolic dynamics [32]. We show that wavelet algo- 
rithms and harmonic analysis naturally generalize to affine iterated function 
systems. Moreover, in this general context, we are able to build the ambient 
Hilbert spaces for a variety of dynamical systems which arise from the iterated 
dynamics of endomorphisms of compact spaces [15]. 

As a consequence, the fact that the ambient Hilbert space in the traditional 
wavelet setting is the more familiar L 2 (M. d ) is merely an artifact of the choice of 
filters itiq. As we further show, by enlarging the class of admissible filters, there 
is a variety of other ambient Hilbert spaces possible with corresponding wavelet 
expansions: the most notable are those which arise from iterated function 
systems (IFS) of fractal type, for example for the middle-third Cantor set, 
and scaling by 3. 

With examples, with theorems, and with graphics, we hope to bring these 
threads to light in this little book: The journey from wavelets to fractals via 
signal processing. 

More generally, there is a variety of other natural dynamical settings (affine 
IFSs) that invite the same computational approach. 

The two most striking examples which admit such a harmonic analysis are 
perhaps complex dynamics and subshifts. Both will be worked out in detail. 
In the first case, consider a given rational function r(z) of one complex vari- 
able. We then get an endomorphism r acting on an associated Julia set X 
in the complex plane C as follows: This endomorphism r : X — > X results by 
restriction to X [6] . (Details: Recall that X is by definition the complement of 
the points in C where the sequence of iterations r n is a normal family. Specifi- 
cally, the Fatou set F of r(z) is the largest open set in C where r n is a normal 
sequence of functions, and we let X be the complement of F. Here r n denotes 
the n'th iteration of the rational function r(z).) The induced endomorphism r 
of X is then simply the restriction to X of r(z). If r then denotes the resulting 
endomorphism, r: X — > X, it is known [16] that ^r~ 1 (x) = degree of r, for 
every x in X (except for a finite set of singular points). 
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In the second case, for a particular one-sided subshift, we may take X as the 
corresponding state space, and again we have a naturally induced finite-to-one 
endomorphism of X of geometric and computational significance. 

But in the general framework, there is not a natural candidate for the ambient 
Hilbert space. That is good in one sense, as it means that the subband filters 
m which are feasible will constitute a richer family of functions on X. 

In all cases, the analysis is governed by a random-walk model with successive 
iterations where probabilities are assigned on the finite sets #r _1 (:r) and are 
given by the function W := |m | 2 . This leads to a transfer operator Rw which 
has features in common with the classical operator considered first by Perron 
and Frobenius for positive matrices, in particular it has a Perron-Frobenius 
eigenvalue, and positive Perron-Frobenius eigenvectors, one on the right, a 
function, and one on the left, a measure; see [42]. This Perron-Frobenius 
measure, also sometimes called the Ruelle measure, is an essential ingredient 
for our construction of an ambient Hilbert space. All of this, we show, applies 
to a variety of examples, and as we show, has the more traditional wavelet 
setup as a special case, in fact the special case when the Ruelle measure on 
T d is the Dirac mass corresponding to the point in T d (additive notation) 
representing zero frequency in the signal processing setup. 

There are two more ingredients entering in our construction of the ambient 
Hilbert space: a path-space measure governed by the H^-probabilities, and cer- 
tain finite cycles for the endomorphism r. For each x in X, we consider paths 
by infinite iterated tracing back with r _1 and recursively assigning probabili- 
ties with W. Hence we get a measure P x on a space of paths for each x. These 
measures are in turn integrated in x using the Ruelle measure on X. The 
resulting measure will now define the inner product in the ambient Hilbert 
space. 

Since the first question is to decide when tp is in L 2 , we iterate and get an 
infinite matrix product involving the matrix W := m*m. Since W is positive 
semidefinite, we may create a positive path measure of a random walk starting 
at x in some period interval. In several dimensions, x starts in a fundamental 
domain D for some fixed lattice, for example Z d . The paths starting at x arise 
by iteration of the inverse branches of x — > Ax mod Z d . There are N = | det A\ 
distinct branches. These N branches may be viewed as endomorphisms of D. 

In this paper, we will consider pointwise convergence of the infinite product 
(3.5) below for the Fourier transform <p of the scaling function ip. But the tradi- 
tional low-pass/regularity considerations for the filter m(x) are more general. 
Once pointwise convergence (Theorem 5.2) is established, then the quadrature 
property for m(x) will imply that the scaling function <p is automatically in 
L 2 (R). 
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We construct our random walks in a general framework which includes both 
wavelets, wavelet packets, and some of the other more classical problems. We 
further show how some of the classical questions may be phrased and solved 
with the use of path space measures. 

It is interesting to contrast our proposed approach with the more traditional 
one used in wavelet analysis: see, e.g., [12]. Traditionally, some kind of Lip- 
schitz or Dini regularity condition must be assumed for the filter. Then the 
corresponding infinite product may be made precise, and we can turn to the 
question of when the wavelet generators are in L 2 (R d ); see, e.g., [30] and [15]. 
As it turns out, both of these issues have natural formulations, and solutions, 
in terms of the path space measures. And the results allow a wider generality. 
In a variety of wavelet questions for band-limited wavelets, the regularity con- 
ditions just aren't satisfied for the filters m that are dictated by the setting 
and the applications; see, e.g., [3]. 



2 Path space 

A well tested tool in analysis, and in mathematical physics, centers around 
the application of path-space measures. This tool is used in attacking a vari- 
ety of singular convergence, or approximation, problems. We will adopt this 
viewpoint in our study of wavelet approximations. Traditionally, the setting 
for wavelet questions has included assumptions concerning continuity, or some 
kind of differentiability. In contrast, we shall work almost entirely in the mea- 
surable category. One advantage of our present approach is that we stay in the 
measurable category when addressing problems from multiresolution analysis 
(MRA). Earlier work on the use of probability in wavelets includes that of 
R.F. Gundy et al; see [14,17,19,21,22,23,24,25]. 

Our present viewpoint is more general than [14], and it starts with the random 
walks naturally associated with a measure space (X, B) and a given measurable 
onto map o: X — > X such that j^o~ x ({x}) = N for all x G X, where N, 
2 < N < oo, is fixed. Iteration of the branches 

a- 1 ({x}):={yeX\a(y)=x} (2.1) 

then yields a combinatorial tree. If 

u = (u 1 ,u 2 ,...)en:= {0,l,...,iV-l} N , 

an associated path may be thought of as an infinite extension of the finite 
walks 
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starting at x, where (Yj), i — 0, 1, . . . , N — 1, is a system of inverses, i.e., where 

ao Ti = id x , < % < N. (2.2) 
If W : X — > [0, 1 ] is a given measurable function such that 

Y,W(y) = l, (2.3) 

y. cr(y)=x 

then an associated measure P x on fl may be defined as follows. Suppose 
some function / e C (ft) depends only on a finite number of coordinates, 
say ui, ... , u n , then set 

I fdP x = J2 f K> ■■■,w n )W (t ui x) W (t^t^x) •■•W(r uln --- r ui x) . 

(ui,...,Un) 

(2.4) 

Extensions of this formula to fl can be done in a number of ways: see the cited 
references. 

A special feature of this construction, which will be explored in the present 
monograph, is that of attractive convergence properties for infinite products 
of the form 

I] W(t^x)---W(t^--- V ) (2.5) 

n,0Ji,...,u n 

over certain subsets of fl. As it turns out, these infinite products are deter- 
mined by the measures (P x ) xeX , and by the Ruelle transition operator 

(R w g) (x) := J2 W (V) 9 (v) , 9 G PO • (2-6) 

y: cr(y)=x 

The operator R in (2.6) is called the transition operator, the Ruelle operator, 
or the Ruelle-Perron-Frobenius operator, and it will play a major role in what 
follows. 

Many problems in dynamics are governed by transition probabilities W, and 
P x , and by an associated transition operator Rw as in (2.6). Wavelet theory 
is a case in point, and we show that fundamental convergence questions for 
wavelets, and properties of the solutions to (1.1), depend on the positive so- 
lutions h to the eigenvalue problem Rw(h) = h. In Theorem 5.2, we show 
that there is a particular solution h, see (3.11), which determines the issue 
of pointwise convergence of the infinite product. We refer to the discussion 
around (3. 11)— (3.12) below. The solutions h are called harmonic, and the 
function h in (3.11) is a special harmonic function which will play a central 
role in our main result, Theorem 5.2 below. 
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3 Multiresolutions 



The multiresolution approach to wavelets involves functions on R. It begins 
with the fixed-point problem 

up (t) = N a k (p {Nt - k) , teR, (3.1) 

where a given sequence (ak) keZ is chosen with special filtering properties, e.g., 
quadrature mirror filters; see [15,20,26,29,30,38,39,46]. The equation (3.1) is 
called the scaling identity, and the a^'s the response coefficients, or the masking 
coefficients. Introducing the Fourier series 

m(x) = Ya k e- l ^ kx (3.2) 

and Fourier transform 

$(x)= I e~ i27Vtx ip (t) dt, (3.3) 

we get the relation 

$(x)=m(j^<p(j£), xeR, (3.4) 
which suggests a closer inspection of the infinite products 

5 •»(£)■ < 3 - 5 > 

Since we shall want solutions tp to (3.1) which are in L 2 (R), (3.4)-(3.5) suggest 
the corresponding convergence questions for the function W :— \m\ . 

When (P x ) x€ ^ x j is the family of measures on Q corresponding to W = \m\ 2 , 
then the formal infinite product 

lv^)| 2 =ft^(^) (3.6) 

is P x ({(0, 0, 0, . . . )}), i.e., the measure of the singleton (0, 0, ... ) (an infinite 
string of zeroes) in {0, . . . , N — 1} N . Our main theorem (Theorem 5.2) gives 
a necessary and sufficient condition for the pointwise convergence of (3.6) in 
a rather general context. We note that because of assumption (2.3), (p will 
automatically be in L 2 (R), once pointwise convergence is established. There 
is a natural way (based on Euclid's algorithm) of embedding Z into 

n = {0, . . . , N - if x {0, . . . , N - if (3.7) 
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such that 

P x (Z) = '£\(p(x + k)\\ (3.8) 

Even though the measures P x (Section 6) are defined a priori on the over- 
countable probability space, the surprise is that they are in fact supported 
only on a fixed thin (countable) subset of Vt. 

Remark 3.1. Note that, in general, it is not at all clear that the measures 
(P x ), x G X, on Q should even have atoms. Typically, they don't! But if 
atoms exist, i.e., when there are points w G such that P x {{uj}) > 0, we 
note that this yields convergence of an associated infinite product. Let No : = 
{0, 1, 2, . . . } = NU {0}. Using Euclid, and the iV-adic expansion 

k = h + i 2 N + ■ ■ ■ + inN* 1 - 1 for k G N , (3.9) 

we see that the points 

uj(k) = (i u . . . , i n , 0, 0, 0, . . . ) 

oo string of zeroes 

represent a copy of No sitting in fl. With the identification k <-> uj (k), we set 

oo 

P x (N ) = '£P x ({u(k)}). (3.10) 

fc=0 

But in general, this function P x (N ) might be zero. Our first observation about 

h(x):=P x (N ), xeX, (3.11) 
is that it solves the eigenvalue problem 

R w h = h. (3.12) 

We say that h is a minimal harmonic function relative to Rw- Note that in 
general h — may happen! 

Remark 3.2. Let X, B, a, To, ... , Tat-i, and W be as described above, and 
let (P x ) xeX ^ e ^ ne corresponding transition probabilities. Let 

o = ( o, o, o, . . . ) g n = {0, 1, . . . , iV - 1} N . 

oo string of zeroes 

While in general, often P x ({0}) = 0, the case when P x ({0}) > is important. 
The condition P x ({0}) > is a way of making precise sense of the infinite 
product 

oo 

P*({0}) = UW(tSx). (3.13) 

n=l 
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If, for example, lim^oo W (tqx) < 1, then it is immediate from (3.13) that 
P x ({0}) = 0. 

Suppose P x ({0}) > 0. Then it follows that 

P x ({(h, . . . , i n , 0)}) = W (r h x) ...W(r in --- r H x) P Tin ... TilX ({0}) . (3.14) 

Using (3.9), we shall identify k G N with the point uo (k) G f2, and write 
Pa: ({A;}) for the expression in (3.14). 

An important question for dyadic wavelets in L 2 (R) is the issue of when 
these wavelets form orthonormal bases (ONB's). A dyadic wavelet function 
ip G L 2 (R) generates an ONB if the double-indexed family 

{ 2 n/2 ip (2 n t -k)\n,ke%} (3.15) 

satisfies (i) and (ii) below: 

(i) / ip n ,k (*) (t) dt = 5 n!m 5 k;l , with 

JR 

iP n , k (t):=2 n ^(2 n t-k), (3.16) 

and 

(ii) the closed linear span of { ij} n ^ \ n, k G Z } is L 2 (R). 
In our analysis of the scaling identity 

<p(t) = 2Y,ak<p(2t-k) (3.17) 

keZ 

(a special case of (3.1)), we will be looking at two functions if and ip; the 
second one may be taken to be 

V> (0 = 2 E ( 2t ' k ) ■ ( 3 - 18 ) 

kei 

This analysis is the approach to wavelets which goes under the name of mul- 
tiresolution analysis. The function ip which is used in (3.15)-(3.16) is the solu- 
tion to (3.18). The two standing conditions which are placed on the numbers 
{ a k)kezi called masking coefficients, are 

^kdk+2n = -<W, neZ, (3.19) 
kez 1 

and 

5> fc = l. (3.20) 

kei 
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These conditions in themselves do not imply orthonormality in (3.16), but 
only the following much weaker property: 

EE \(^k I f)f = ll/ll 2 = / 1/ (t)f dt, f e L 2 (R) . (3.21) 

n,k€Z JR 

A system of functions (i/) n ,k) satisfying (3.21) is called a Parseval frame, or a 
normalized tight frame. 

Given (3.19)-(3.20), it turns out that the ONB property for the wavelet is 
equivalent to either one of the following two conditions for the normalized 
scaling function ip: 

f^j<p(t-k) dt = 5 , k , (3.22) 

JR 

or 

IMIl2(r) = L ( 3 - 23 ) 



4 Sampling 

In this section and the next, we study an intriguing relationship between the 
following three problems: 

(1) When does the scaling identity (3.1) have L? solutions? 

(2) How may the transition probabilities P x be used in sampling certain 
functions at the points x + k as k runs over a set of integers? 

(3) When is the infinite product (3.5) pointwise convergent? 

In the wavelet applications, X = [ 0, 1 ], and the system a, To, ... , r A r_ 1 is as 
follows: 

a (x) = Nx mod 1, 

r 1 {x) = X -±t, 3 = 0,1,. ..,N-l. (4,1) 

Lemma 4.1. Setting F (x) := P x ({0}), and 

k = i 1 +i 2 N + --- + i n N n - 1 (GN ), (4.2) 

we get the formula 

P x ({k}) = F(x + k), (4.3) 
where we have identified k with the point uj (k) := (i 1; . . . , i n , 0) in Q. 

Proof. To see this, identify functions on [0,1] with 1-periodic functions on R, 
and note that the second formula in (4.1) yields T in - • • r h (x) = (x + k) /N n 
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where k is given by (4.2). Hence, if 1 < s < n, then 

W s ' ' • T - ^ = ^ ( J = ^ (^V* - J • 

It follows that (4.3) is really just a rewrite of (3.14). The right-hand side of 
(3.14) yields 

^)-" w (^M f ^) = F( * + fc) ' (44) 

which is the desired conclusion. □ 



Remark 4.2. To extend P x ( • ) from N to Z, recall that k G N is identified 
with the singleton u (k) = ...,i n , 0) via (4.2). Now, if -N n < k < 0, then 

set 

P x ({k}):=P x ({io(N n+1 + k)}). (4.5) 

To help the reader gain some intuitive feeling for the conclusion in Lemma 4.1, 
observe that the right-hand side of (4.3) represents a sampling of the function 
F at the integral translates on R, starting at x, i.e., x + k. Obviously, different 
subsets of Q would yield different sets of sampling points for F, including 
nonuniformly distributed sampling points; see [1]. 

Starting with Shannon [43], the theory of sampling has emerged as a significant 
tool in signal processing; see, e.g., the beautifully written survey [1] as well 
as the references cited therein. Thus, in a general context, our formula (4.3) 
offers a probabilistic prescription for sampling of functions on the real line, 
and at the same time it stresses the 'random' feature of sampling. 



5 A convergence theorem for infinite products 



We now show how this viewpoint from sampling theory is closely related to 
some fundamental properties of the measures P x . In particular, our Theorem 
5.2 gives a necessary and sufficient condition for pointwise convergence of the 
infinite product (3.5), or more generally (2.5), with the condition for conver- 
gence stated in terms of the harmonic function h of (3.11). The relationship 
between h and the measure family P x is studied more systematically below. 

Let A C Z. Returning to (4.3), we set 

P x (A) := £ P x ({k}) = £ F (x + k) . (5.1) 

keA k€A 
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As in Lemma 4.1, the number N, N > 2, and the function W are given. The 
measures P x are constructed from these data using (2.4), and we have the two 
functions 

F(x) :=P X ({( 0, 0, 0, ...)}) (5.2) 

S v ' 

oo string of zeroes 

and 

h (x) := P x (Z) , x E R. (5.3) 

Finally, for k eN, set 

N k Z := { N k j | j e Z } . (5.4) 
Using (4.5), we see that N k Z is represented in Q = {0, 1, . . . , N — 1} N as 

(0, 0, wi, u 2 , ., 0, 0, 0, ... ). (5.5) 

^^^^^^^^^^^ 

k zeroes a finite string oo string of zeroes 

of symbols 

wie{o,...,JV-i} 

An infinite string of zeroes will be denoted 0. 

Lemma 5.1. Let N, W , F , P x , and h be as described above; see (5.2)-(5.3). 
Then h satisfies the following cocycle identity: 

h (x) W (x) = P Nx (NZ) , xei (5.6) 

In particular, if h (x) = 1 a.e. x G K, then we recover the function W from the 
transition probabilities P x , a.e. i£l 

Proof. We calculate the left-hand side in (5.6), using the earlier equations: 

h(x)W(x) = W(x)J2F(x + j) 

b y ( 4 - 3 ) ,-p Z 

and (5.3) J 
by periodicity 

by (4.4) j& 

J2F(Nx + Nj) 

by (4.3) j& 

= P Nx (NZ) . 
by (5.1) 

This is the desired identity (5.6), and the proof is completed. □ 

Theorem 5.2. Let N, W, F, P x , and h be as described above. Let and 
suppose that P x ({0}) > 0. Then the following two conditions are equivalent. 
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(a) The limit on the right-hand side below exists, and 



n , 
F (x) = lim TT W 



n^oo 

k=l 



X 



(b) The limit on the left-hand side below exists, and 



lim h ( 1 = 1. 



Proof, (a) =>- (b). An iteration of the identity (5.6) in Lemma 5.1 above yields 
Using (5.5), and working in Q, we find 



p| N k Z = {0} . (5.8) 

ken 

An application of a standard result in measure theory [41, Theorem 1.19(e), 
p. 16] now yields existence of the following limit: 

lim P x (N k Z) = P x ({0}) = F (x) . (5.9) 

fe^oo V / by (5.2) 

Since (a) is assumed, and F (x) > 0, we conclude that the limit h (x/N k ), for 
k — > oo, must exist as well, and further that 

F( X ) = F( X ) limft(^). 

Using again F (x) > 0, we finally conclude that (b) holds. 

(b) =>- (a). Recall that formula (5.7) holds in general. If (b) is assumed, we 
then conclude that the limit IljLi W (x/N^) exists as k — > oo. The limit on 
the left-hand side in (5.7) exists and is F (x). As a result, we get 

k k 

F{x) = lim f[w(4A lim h(^) = lim TT W ( ), 

V ' fc-oo£* \N3 J k^oo \N k J using (b) fc-oo ^i J 

which is (a). □ 



6 Ruelle's wavelet transition operator 



The next definitions (Definitions 6.1) and the lemma (Lemma 6.2) give us the 
precise details behind the two critical notions from Theorem 5.2 above; i.e., 
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the cocycles, and the random walk measures P x . An existence and uniqueness 
theorem for P x is then presented in Section 8. 

Definitions 6.1. 

(a) The Ruelle operator R = Rw is defined by 

(Rf)(x)= £ W(y)f(y), x e X, / e L°° (X) , (6.1) 

yeX, cr(y)=x 

and maps L°° (X) into itself. 

(b) Let f2 be the compact Cartesian product 

oo 

n = Z^ = {0,...,N-l} N = H{0,...,N-l}. (6.2) 

i 

(c) A bounded measurable function V : X x Q — > C is said to be a cocycle if 

V (x, (wi, w 2 , ...))= V (r wi (x) , (u 2 , u> 3 ,... )) (6.3) 

for all uj = (ui,U2, • • • ) £ H. 

(d) A function /i: X — > C is said to be harmonic, or R w -harmonic, if 

= h. (6.4) 

(e) Let nGN, and let i\, . . . , % n e Zjv- Then the subset 

A . . . , i n ) := { w e Q | uj ± = ii, . . . , u n = i n } (6.5) 

is called a cylinder set. 

We shall use the following correspondence (see [33] and [17] for details and 
proofs) between the cocycles V from (c) and the harmonic functions h from 
(d): If V is given as in (c), then the function h(x) := P x [V(x, ■)} has the 
properties in (d). Conversely, for every h satisfying (d), including (6.4), there 
is a martingale limit which lets us recover a cocycle V, P x a.e., such that 
h(x) := P x [V(x, ■ )]. In the present discussion, we are interested in the cocycles 
V which arise as indicator functions Xs f° r certain cyclic and invariant subsets 
S of f2. We refer to Section 8 below. 

Existence of the measures P x 

The cylinder sets generate the topology of Q, and its Borel sigma-algebra. In 
determining Radon measures on Q, it is therefore convenient to first specify 
them on cylinder sets. This approach was initiated by Kolmogorov [35]; see 
also Nelson [40]. Recall that f2 is compact in the Tychonoff topology, and 
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that we may use the Stone- WeierstraB theorem on C (ft) = the algebra of all 
continuous functions on Q. 

Lemma 6.2. Let X , B, fi, a, r 0; . . . , r N _i, and W be given as described 
above. We make the following more restrictive assumption on W : 

W(y) = 1 a.e. x E X. (6.6) 

yeX,cr(y)=x 

Then for every x € X there is a unique positive Radon probability measure P x 
on Q such that 

P x (A(i 1 ,...,i n )) = W (r h x) W (r l2 r h x) ...W(r in --- r H x) . (6.7) 



The main fact about the Ruelle operator 

R W f{x)=Y,W(TiX)f(TiX) 

i 

is this [17]: Under suitable conditions on W, there is a unique probability 
measure (Ruelle measure u) satisfying 

v ■ R w = z/, 

and a unique continuous minimal eigenfunction h m i n , 

R\vhm'm h m i n . 

This function h min is minimal in the ordered convex set 

^heC(X) < h < 1, R w h = h, J x hdv = lj , 

and moreover 

h m in (x) = P x (Z) . 

Remark 6.3. It turns out that the general case J2 y • • • < 1 may be reduced 
to (6.6). So (6.6) is not really a restriction. 

Proof of Lemma 6.2. If P is a Radon measure on Q, we set 

P[f]:= J f(u) dP(u) for all / e C (ft) . (6.8) 

Set 

C fin (fl) = {/GC(fl) | 3 n such that / (w) = f (u u . . . , u n ) , 

i.e., / depends only on the first n coordinates in Q }. (6.9) 
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Increasing n in definition (6.9), we get an ascending nest of subalgebras of 

c(n), 

2li C 2l 2 C ••• C 2t„ C 2l n+ i C ••• , (6.10) 

with 

oo 

U On = C (ft) , 
n=l 

where " stands for norm- closure. We set 

oo 

C fin (ft) := |J 2l n . 

n=l 

An immediate application of Stone- Weierstrafi shows that C& n (ft) is uniformly 
dense in C (ft). Let x G X, and / G Cfi n (ft)- Suppose 

/M = /(wi,...,w n ), 

and set 

*U/]= E W(v)"-W r (^„---V)/(wi,...,Wn)- (6.H) 

(wi,...,uj„)eZ£ 

Note that, if there is some h G L°° (X) such that 

f(ui,...,u n ) = h [r Un ■ ■ ■ t Ui x) , 



]T ^M-^^.-t^M^-v) (6.12) 

(wi,...,w„) 

x: ^( ( 7 n - i y)---^(^)^(t / )/ i ( 2/ ) 

We now show that P x [ / ] is well-defined. This is the Kolmogorov consistency: 
we must check that the number P x [ f ] is the same when some / G 2l n (c 2l n +i) 
is viewed also as an element in 2l n+ i. Then 

f(u) = /(wi,...,w n ) 

— / (<^1> • • • ) Wjh ^n+l) , 



then 

^[/] = 
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and 

Px [ fn+1 }= W (V 1 ) '••W ( r ^„+i • • • T m x ) f (^1, • • • , Wn+i) 

wi,...,u n +i 

= J2w( v )---w(T Wn ... v ) 

wi,...,w„ 




=1 by (6.6) 



= W ( T ^l X ) ' ' ' W ( T "n ' ' ' T "1 X ) f ( U U ■ ■ ■ , W„) 

wi,...,o;„ 
= -fx [ /n ] 5 

as claimed. 

The consistency conditions may be stated differently in terms of conditional 
probabilities: for / G C (f2), set 

^j n) [/]=^[/|2tn] (6.13) 

= Yl W (V 1 ) • • • W (^n • • • T^X) f (Wl, ...,W„). 

(wi,...,w„) 

We proved that 

pW[ / ] = p(«+D[ / ] forall/G2l„. 

Using now the theorems of Stone- WeierstraB and Riesz, we get the existence of 
the measure P x on Q. It is clear that it has the desired properties. In particular, 
the property (6.7) results from applying (6.11) to the function 

f(u) := 5 iim ■■■5 inj0Jn , ujett, (6.14) 

when the point (ii, . . . , i n ) is fixed. 

These functions, in turn, span a dense subalgebra in C (Q) (by Stone-Weier- 

straB), so P x is determined uniquely by (6.7). □ 



7 Transition measures 

We now compute the Markov transition measures P x , as x varies over the set 
X . The construction of P x , and the probability space Q, begins with a sequence 
of measures Pj n \ n = 1,2,..., corresponding to finite paths of length n. 
Then we show that the infinite path-space measure, Lemma 6.2, results from 
an application of the Kolmogorov extension principle. Our technical analysis 
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involves a certain transition operator R which generalizes Lawton's wavelet 
transition operator; see [36,37]. 

We already mentiond how the measures P x serve to prescribe sampling of func- 
tions on the real line, Lemma 4.1. However, a deeper understanding of this 
sampling viewpoint is facilitated by the introduction of the Perron-Frobenius- 
Ruelle operator R, Definitions 6.1, and an associated family of harmonic func- 
tions. For these harmonic functions, we note in Section 6 that there is a crucial 
analogue from classical harmonic analysis of Fatou-boundary function in the 
present discrete context. As noted after Definition 6.1 below, the boundary 
value functions take the form of certain cocycles, and the function h from 
(3.11) corresponds to a special No-cocycle. The existence of these cocycles is 
based on a martingale convergence theorem; see [33, Theorem 2.7.1] and [17] 
for additional details. 

A fundamental tightness condition for the random walk model is introduced. 
The transition probabilities P x live in a universal probability space Q, but 
the essential convergence questions for the infinite products depend on the 
the P^'s being supported on a certain copy of No (the natural numbers), or 
of Z (the integers). This refers to the natural embedded of Z in which we 
described above. 



8 Kolmogorov's consistency condition 

For general reference, we now make explicit the extension principle of Kol- 
mogorov [35] in its function-theoretic form. 

Lemma 8.1. (Kolmogorov) Let N > 2 be fixed, and let 

Q= {0,1,...,N -1} N . 

For n = 1, 2, . . . , let 

P (n) : 2l n -> C 

be a sequence of linear junctionals such that (i)-(iii) hold: 

(i) p( n ) [ II ] = 1, where II denotes the constant function 1 on Q, 

(ii) f E «„, / > pointwise =>• P [n) [f] > 0, 

and 

(tit) PW [/] = P(" +1 ) if] for all f e 2l n . 
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Then there is a unique Borel probability measure PonVt such that 



P[f]=P [n) [f], 



(8.1) 



Specifically, for P, we have the implication 



f G C (ft) , f > pointwise 



P[f]>0. 



(8.2) 



Remark 8.2. Here we have identified positive linear functionals P on C (Vt) 
with the corresponding Radon measures P on Q, i.e., 



This identification P <-> P is based on an implicit application of Riesz's theo- 
rem; see [41, Chapter 1]. 

Proof of Lemma 8.1 . The proof of Kolmogorov's extension result may be given 
several forms, but we note that the argument we used above (in a special case), 
based on an application of the Stone-WeierstraB theorem, also works in gen- 
eral. □ 



The probability space f2 

We note that the probability space Q itself carries mappings o and Tj, % — 
0, 1, . . . , N — 1. Stressing the ^-dependence, we write 



for uj = (u>i, u>2, • • • ) G fl 

Remark 8.3. The connection between the cylinder sets in (6.5) and the iter- 
ated function systems (IFS) (X, a, To, ... , rjv-i) may be spelled out as follows: 
the cylinder sets in Q generate the sigma-algebra of measurable subsets of 
Q, and similarly the subsets r ix • ■ ■ r in (X) C X generate a sigma-algebra of 
measurable subsets of X. When nothing further is specified, these will be the 
sigma-algebras which we refer to when discussing the measurable functions on 
Q and X. In particular, we will denote by M (Q) and M (X) the respective 
algebras of all bounded measurable functions on Q, respectively X. 




(8.3) 



a (u) = (uj 2 ,uj 3 , . . . ) and t { (u) = (i, u u u 2 , ■ ■ ■ ) 



(8.4) 



Note that if X = [ 0, 1 ], and if 



x + i 



< % < N - 1, 



TiX = 



N 
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then we recover the familiar iV-adic subintervals: 



rii • • • n n (X) 



(8.5) 



Lemma 8.4. There is a unique mapping p: M (fl) — > M (X) which satisfies 

p(fg) = p(f)p(g) (8.6) 

and 

P (xA(ii,..,i„)) = X Til -T in (A-)- ( 8 - 7 ) 

TTie mapping p is an isomorphism of M (fl) onto M (X) . 

Proof. Recalling (6.14), we note that 

XA{n,..., in ) M = S iim ■ ■ ■ 5 iM ueQ. 

As a result, 

XA(ii,...,i n )XA(j 1 ,...,j n ) = ' ' ' din,jnXA(h, ...,?„)■ (8-8) 
We then define p first on 2l„ by 



Ti„(-X-)> 



P I X! a u,-,»nXA(ii,...,i n ) I — X! a h,:.,inX Til --- 

where a^,...,^ G C, and note that (8.6) is satisfied. 

It is easy to check that the extension of p from 2l n to 2l n +i is consistent. The 
final extension from \J n 2l n to M (ft) is done by Kolmogorov's lemma, and it 
can be checked that p has the properties stated in the conclusion of the present 
lemma. □ 

Theorem 8.5. Let X, W, and N be as described in the beginning of this 
chapter, and let { P x \ x G X } be the process obtained in the conclusion of 
Lemma 6.2. Then 

JV-l 

]T W (r iX ) P TiX [ f (i, ■ ) ] = P x [ f ] for all / e C (fi) . (8.9) 

i=0 

Moreover, equation (8.9) determines (P x ) uniquely. 

Remark 8.6. Stated informally, formula (8.9) is an assertion about the ran- 
dom walk: it says that if the walk starts at x, then with probability one, it 
makes a transition to one of the N points t (x), . . . , rjv-i (x). The probability 
of the move x i— > r iX is W fax). Recall (6.6) asserts that J2i W fax) = 1. 
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Proof of Theorem 8.5. It follows from (6.11) and the arguments in the proof 
of Lemma 6.2 that it is enough to verify (8.9) for / e C^ n (Q), or for / e 2l n . 
Let / e 2l n . Then 

i 

= H ^ ( r ^) W CvTjz) • • • VF {r Un • • • r^Tjx) / (i, u u . . . , u n ) 

i u)i,...,aj„ 
by (6.11) 

which is the desired conclusion. Recall Cfi n (Q) is norm-dense in C (Q). 

Note that the formula (8.9) generalizes the familiar notion of selfsimilarity for 
measures introduced by Hutchinson in [28]. In fact, (8.9) may be restated as 

N-i _ L 

]T W (nx) P TiX o (rf ) =P X . (8.10) 

i=0 

The uniqueness part of the theorem follows then from Lemma 8.1. □ 
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